Back

BMC Medical Research Methodology

41 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
An E-value-Informed Sensitivity Analysis Framework for Hybrid Controlled Trials
2026-03-06 epidemiology 10.64898/2026.03.05.26347653
Top 0.4% (4.6%)
Show abstract

Hybrid controlled trials (HCTs) incorporate real-world data into randomized controlled trials (RCTs) by augmenting the internal control arm with patients receiving the same treatment in routine care. Beyond increasing power, HCTs may improve recruitment by supporting unequal randomization ratios that increase patient access to experimental treatments. However, HCT validity is threatened by bias from unmeasured confounding due to lack of randomization of external controls, leading to outcome non-...

2
Show Your Work: Verbatim Evidence Requirements and Automated Assessment for Large Language Models in Biomedical Text Processing
2026-03-04 health informatics 10.64898/2026.03.03.26346690
Top 0.4% (4.1%)
Show abstract

PurposeLarge language models (LLMs) are used for biomedical text processing, but individual decisions are often hard to audit. We evaluated whether enforcing a mechanically checkable "show your work" quote affects accuracy, stability, and verifiability for trial eligibility-scope classification from abstracts. MethodsWe used 200 oncology randomized controlled trials (2005 - 2023) and provided models with only the title and abstract. Trials were labeled with whether they allowed for the inclusio...

3
Evaluating a Locally Deployed 20-Billion Parameter Large Language Model for Automated Abstract Screening in Systematic Reviews
2026-03-04 health informatics 10.64898/2026.03.04.26347506
Top 0.8% (2.8%)
Show abstract

BackgroundSystematic reviews (SRs) are essential for evidence-based medicine but require extensive time and resources for abstract screening. Large language models (LLMs) offer potential for automating this process, yet concerns about data privacy, intellectual property protection, and reproducibility limit the use of cloud-based solutions in research settings. ObjectiveTo evaluate the performance of a locally deployed 20-billion parameter LLM for automated abstract screening in systematic revi...

4
Trustworthy personalized treatment selection: causal effect-trees and calibration in perioperative medicine
2026-03-04 health informatics 10.64898/2026.03.03.26347440
Top 1% (2.0%)
Show abstract

BackgroundPersonalized medicine promises to tailor treatments to the individual, but it carries a hidden risk: mistaking statistical noise for actionable clinical insight. Current machine learning approaches often provide predictions, but fail to inform clinicians when those predictions are unreliable. ObjectiveDevelop a deployment-readiness framework that integrates causal inference, interpretable effect-trees, and calibration assessment to distinguish actionable signal from unreliable variati...

5
Variability in Automated Sepsis Case Detection: A Systematic Analysis of Implementation Methods in Clinical Data Repositories
2026-03-04 health informatics 10.64898/2026.02.27.26347259
Top 1% (1.9%)
Show abstract

ObjectiveTo systematically identify and characterize methodological heterogeneity in sepsis case detection methods using the MIMIC-III database or the eICU-CRD, and to quantify the resulting variability in sepsis detection rates. Materials and MethodsWe conducted a PRISMA-guided systematic review of PubMed and Web of Science (2016-2024), and stratified studies by cohort definition to obtain comparable subsets. We extracted information on sepsis case detection methodology across six domains: par...

6
Perceptions of Artificial Intelligence in the Editorial and Peer Review Process: A Cross-Sectional Survey of Traditional, Complementary, and Integrative Medicine Journal Editors
2026-03-04 health informatics 10.64898/2026.03.04.26347571
Top 2% (1.9%)
Show abstract

BackgroundArtificial intelligence chatbots (AICs) are increasingly being integrated into scholarly publishing, with the potential to automate routine editorial tasks and streamline workflows. In traditional, complementary, and integrative medicine (TCIM) publishing, editorial and peer review processes can be particularly complex due to diverse methodologies and culturally embedded knowledge systems, presenting unique opportunities and challenges for AIC adoption. MethodsAn anonymous, online cro...

7
Class imbalance correction in artificial intelligence models leads to miscalibrated clinical predictions: a real-world evaluation
2026-03-05 health informatics 10.64898/2026.03.04.26347634
Top 2% (1.9%)
Show abstract

BackgroundPredictive models employing machine learning algorithms are increasingly being used in clinical decision making, and improperly calibrated models can result in systematic harm. We sought to investigate the impact of class imbalance correction, a commonly applied preprocessing step in machine learning model development, on calibration and modelled clinical decision making in a large real-world context. MethodsA histogram boosted gradient classifier was trained on a highly imbalanced na...

8
A Qualitative Study of Patient and Healthcare Provider Perspectives on Mobile Health Assessments for Cervical Spondylotic Myelopathy
2026-03-05 health informatics 10.64898/2026.03.04.26347622
Top 3% (1.4%)
Show abstract

Objective: Evaluating and monitoring patients with cervical spondylotic myelopathy (CSM) remains a challenge due to limited tools for assessing objective neurological disability longitudinally and in the home environment. Given their prevalence and low cost, mobile health (mHealth), and specifically smartphone technologies offer a promising approach to fill this gap. This study explored stakeholder perspectives on the role of mHealth in CSM monitoring to inform development of a smartphone-based ...

9
Enhancing competency in clinical trials management: Findings from a multicountry trial coordinators interventional training program
2026-03-04 medical education 10.64898/2026.03.03.26347517
Top 4% (1.4%)
Show abstract

BackgroundClinical research coordinators play a crucial role in ensuring the scientific rigor, regulatory compliance, and operational integrity of clinical trials. However, in Africa, they often lack access to structured, competency-based training, especially in operational, regulatory, and trial management domains. This study evaluated the effectiveness of a comprehensive training intervention designed to standardize and enhance core competencies of clinical trial coordinators. MethodsWe condu...

10
Thyroid Cancer Risk Prediction from Multimodal Datasets Using Large Language Model
2026-03-06 health informatics 10.64898/2026.03.05.26347766
Top 4% (1.3%)
Show abstract

Thyroid carcinoma is one of the most prevalent endocrine malignancies worldwide, and accurate preoperative differentiation between benign and malignant thyroid nodules remains clinically challenging. Diagnostic methods that medical practitioners use at present depend on their personal judgment to evaluate both imaging results and separate clinical tests, which creates inconsistency that leads to incorrect medical evaluations. The combination of radiological imaging with clinical information syst...

11
Effectiveness of new treatment modalities for localized prostate cancer through patient-reported outcome measures: 5 years comparative study.
2026-03-05 epidemiology 10.64898/2026.03.04.26347624
Top 5% (1.3%)
Show abstract

BackgroundNo randomized clinical trial comparing the most established new modalities of treatment for patients with localized prostate cancer has been published, and there is scarce comparative effectiveness research assessing Patient-Reported Outcome Measures (PROMs). Objectiveto compare the impact of active surveillance, robot-assisted radical prostatectomy (RARP), Intensity-modulated radiotherapy (IMRT), and real-time brachytherapy on patients, through PROMs, from pre-treatment to five years...

12
Medical concept understanding in large language models is fragmented
2026-03-05 health informatics 10.64898/2026.03.03.26347552
Top 5% (1.2%)
Show abstract

Large language models (LLMs) perform strongly across a wide range of medical applications, yet it remains unclear whether such success reflects genuine understanding of medical concepts. We present an ontology-grounded, concept-centered evaluation of medical concept understanding in LLMs. Using 6,252 phenotype concepts from Human Phenotype Ontology, we decompose concept understanding into three core dimensions--concept identity, concept hierarchy, and concept meaning--and design corresponding be...

13
Red-Teaming Medical AI: Systematic Adversarial Evaluation of LLM Safety Guardrails in Clinical Contexts
2026-03-05 health informatics 10.64898/2026.02.26.26347212
Top 6% (1.1%)
Show abstract

BackgroundLarge language models (LLMs) are increasingly deployed in medical contexts as patient-facing assistants, providing medication information, symptom triage, and health guidance. Understanding their robustness to adversarial inputs is critical for patient safety, as even a single safety failure can lead to adverse outcomes including severe harm or death. ObjectiveTo systematically evaluate the safety guardrails of state-of-the-art LLMs through adversarial red-teaming specifically designe...

14
Using the ECHILD Database to Explore Educational and Health Outcomes of Unaccompanied Asylum-Seeking Children living in England (2005 to 2021)
2026-03-04 health informatics 10.64898/2026.03.04.26347576
Top 6% (1.0%)
Show abstract

UK-based quantitative research on the health and education outcomes of Unaccompanied Asylum-Seeking Children (UASC) remains limited, especially at national level. Linked administrative data provide an unprecedented opportunity to study these outcomes among UASC. This paper lays a foundation for further research, particularly examining the influence of socio-demographic, legal and environmental factors on UASCs health and educational outcomes. We described the UASC population with a first record...

15
Population differences in wearable device wear time: Rescuing data to address biases and advance health equity
2026-03-06 health informatics 10.64898/2026.03.06.26347799
Top 6% (1.0%)
Show abstract

Wearable devices present transformative opportunities for personalized healthcare through continuous monitoring of digital biomarkers; however, individual variations in device wear time could mask or otherwise impact signal identification. Despite the widespread adoption of wearable devices in research, no comprehensive framework exists for understanding how wear time varies across populations or for addressing wear time-related biases in analysis. Using Fitbit data from 11,901 participants in t...

16
Insights from the second season of collaborative influenza forecasting in Italy with updated targets incorporating virological information
2026-03-04 epidemiology 10.64898/2026.03.04.26347601
Top 6% (1.0%)
Show abstract

We present results from the second season of Influcast, a multi-model collaborative forecasting hub focused on influenza in Italy. During the 2024/25 winter season, Influcast collected one-to four-week-ahead probabilistic forecasts of influenza-like illness (ILI) incidence alongside influenza A and B ILI+ incidence signals. New ILI+ targets were constructed integrating syndromic surveillance data with virological detections collected weekly by the Italian National Institute of Health. Forecasts ...

17
Personalized Insights Derived from Wearable Device Data and Large Language Models to Improve Well-Being
2026-03-04 health informatics 10.64898/2026.03.03.26347299
Top 7% (0.9%)
Show abstract

Health behaviors such as physical activity and sleep affect mental health, but the effect of each health behavior varies substantially across individuals, limiting the usefulness of generic behavioral recommendations. We collected one year of continuous wearable and ecological momentary assessment data from 3,139 participants in the Intern Health Study (2018-2023), and examined individual-level associations between wearable-derived features and mood across the internship year. The behaviors asso...

18
Enhancing Prediabetes Diagnosis from Continuous Glucose Monitoring Data via Iterative Label Cleaning and Deep Learning
2026-03-05 health informatics 10.64898/2026.03.04.26347604
Top 9% (0.7%)
Show abstract

As of early 2026, over 115 million US adults (more than 1 in 3) have prediabetes, a condition with an annual conversion rate of 5%-10% to type 2 diabetes. Total diabetes (diagnosed and undiagnosed) affects approximately 40.1 million Americans, or 12% of the population, with roughly 1.5 million new cases diagnosed annually. Continuous Glucose Monitoring (CGM) provides real-time, 24/7 insights into glycemic variability, detecting dangerous highs, lows, and trends that HbA1c (a 3-month average) mis...

19
The Impact of Neglecting Vaccine Unwillingness in Epidemiology Models
2026-03-06 epidemiology 10.64898/2026.03.05.26347735
Top 9% (0.7%)
Show abstract

With significant population fractions in many societies who refuse vaccines, it is important to reconsider how vaccination is incorporated into compartmental epidemiology models. It is still most common to apply the vaccination rate to the entire class of susceptibles, rather than to use the more realistic assumption that the vaccination rate function should depend only on the population of susceptibles who are willing and able to receive a vaccination. This study uses a simple generic disease m...

20
A bootstrap particle filter for viral Rt inference and forecasting using wastewater data
2026-03-06 epidemiology 10.64898/2026.03.06.26347747
Top 10% (0.7%)
Show abstract

Wastewater is increasingly being recognized as an important data stream that can contribute to infectious disease surveillance and forecasting. With this recognition, a growing number of statistical inference approaches are being developed to use wastewater data to provide quantitative insights into epidemiological dynamics. However, few existing approaches have allowed for systematic integration of data streams for inference, for example by combining case incidence data and/or serological data ...